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(54) Method and apparatus for analyzing software executed in embedded systems 



(57) A software analysis system for capturing tags 
generated by tag statements in instrumented source 
code. The software analysis system includes a probe 
that monitors the address and data bus of the target 
system. When a tag statement is executed in the target 
system, a tag is written to a predetermined location in 
the address space of the target system. The tag con- 
tains a tag value that is indicative of the location in the 
source code of the tag statement generating the tag. By 



monitoring the predetermined address, the probe is 
able to capture tags as they are written on the data bus 
of the target , system. By properly instrumenting the 
source code, the software analysis system is able to 
perform a variety of analysis functions in essentially real 
time, Including code coverage, function and task execu- 
tion times, memory allocation, call pairs, and program 
tracing. 
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Description 

Technical Field 

This invention relates to software analysis, and 
more particularly to a method and apparatus for analyz- 
ing a wide variety of criteria of software executed on 
embedded systems. 

Background of the Invention 

Software is being written to control the operation of 
processors, including microprocessors, in a wide variety 
of fielda As software becomes more complex and 
lengthy, the probability of software errors or "bugs" 
increases. Furthermore, the difficulty of finding software 
bugs increases with this increased length and complex- 
ity of software. While bugs that prevent execution of the 
software will be apparent other types of bugs merely 
effect the performance or efficiency of the software with- 
out preventing its execution. Software bugs that merely 
effect the execution of the software may easily go unde- 
tected, thus indefinitely impairing the efficiency of the 
software. For example, software may allocate memory 
resources in an inefficient manner thus preventing the 
software from running at optimum speed. However, 
since the software continues to execute, the existence 
of these memory allocation errors will not be apparent 

A number of techniques have been developed to 
analyze the performance of software in an attempt to 
find software bugs, including software bugs that merely 
effect the performance of the software execution. One 
traditional technique is instrumented source code in 
which executable tag statements are inserted into vari- 
ous branches and locations of source code, thereby 
"instrumenting" the source code. After the source code 
has been compiled and linked, the tag statements are 
executed along with the code. As each tag statement is 
executed, it performs an operation that can be either 
detected by an analysis device or recorded for later 
examination. For example, each tag statement may 
write a value to a respective address so that the identity 
of the address containing that value provides an indica- 
tion of which tag statements were executed. As another 
example, each tag statement may prirrt tag identifying 
data to a disk file. As still another example, an array can 
be reserved in memory, with each array element corre- 
sponding to a tag inserted in a respective location in the 
source code. As each tag is executed, it sets a corre- 
sponding bit in the array. One approach to analyzing 
software with instrumented code is described in U.S. 
Patent No. 5,265,254 to Blasciak et ai. 

Using instrumented code, a wide variety of software 
parameters can be analyzed. Not only can instrumented 
source code, allow one to determine which branches 
have been executed, but it can also determine the exe- 
cution time of a branch or function by placing executable 
tag statements at the entry and exit points of the branch 
or function. When these tag statements are executed. 



they generate respective tags which are time stannped 
so that the elapsed time between executing the tag 
statements can be determined. 

Although instrumented code is often a useful tech- 

5 nique for analyzing the performance of software in a 
general purpose (/.e., "host") computer system, it is less 
suitable for analyzing the execution of software in an 
embedded system. An embedded system is a system 
whose primary purpose is to perform a specific function 

10 rather than to perform general connputationar functions. 
For example, a microprocessor-based microwave oven 
controller, a microprocessor-based automobile ignition 
system, and a microprocessor-based telephone switch- 
ing system are all embedded systems. Embedded sys- 

is tems do not lend themselves to instrumented code for 
several reasons. First, embedded systems generally do 
not have mass storage devices, such as disc storage, to 
store the result of tag statement executions. While the 
result of executing a tag statement can be stored in on- 

20 board random access memory, it is often difficult to 
externally retrieve such information. Furthermore, stor- 
ing the results of tag statement executions in system 
memory consumes system memory resources thus pre- 
venting the target from executing the software in a nor- 

25 mal manner. It is generally desirable to test the . 
performance of software in an embedded system under 
the same conditions that the software will normally run. 
Thus, an ideal software analysis technique would be 
"transparent" to the target system and thus have no 

30 effect on the manner in which the target system exe- 
cutes software. For these reasons, instrumented code is 
generally not suitable for analyzing software in an 
embedded system. 

In addition to software-based software analysis 

35 techniques {e.g., instrumented code), hardware-based 
techniques have been developed to analyze software 
executing in embedded systems. For example, logic 
probes have been placed on the address and data bus 
lines of microprocessors in an attempt to observe the 

40 execution of software in embedded systems. However, it 
is very difficult to monitor the execution of software 
using logic analyzers, and the lack of any data reduction 
on the output of the logic analyzer makes this technique 
very time-consuming. Furthermore, it is not always pos- 

45 sible to determine which instructions are being exe- 
cuted using the logic analyzer. For example, processors 
executing instructions from internal cache memory can- 
not be monitored using a logic probe because the exe- 
cution of these instructions is not reflected on externally 

50 accessible busses. 

Another hardware-based technique for analyzing 
the performance of software in embedded systems 
uses an emulator in connection with instrumerrted code. 
Basically, this technique uses an emulator to monitor 

55 the execution of tag statements thus eliminating the 
need to consume system memory resources and pro- 
viding a means to extract tag execution data. One 
example of this approach is described in U.S. Patent 
No. 4,914,659 to Erickson. As described in tine Erickson 
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patent, tag statements are inserted in the source code 
and executed in an emulator connected to the target 
system. Each of the tag statements writes a variable to 
a respective unique address. The emulator monitors the 
address bus of the emulated processor to detect 5 
addresses on the address bus corresponding to the 
respective tag statements. While the approach 
described in the Erickson patent does extract the tag 
execution data without consuming system resources; it- 
nevertheless suffers from a number of limitations. For 70 
example, by requiring that there be a unique address 
reserved for each tag statement, overlay memory tech- 
niques must be employed and a substantial amount of 
the target system's address space is consumed. 

Another hardware approach to analyzing software 15 
executing in an embedded system is described in U.S. 
Patent No. 4,937,740 to Aganwal et al. The Agarwal et 
al. patent discloses a software analysis system in which 
a hardware probe monitors the address bus of the tar- 
get system to capture addresses. The system disclosed 20 
in the Agarwal et al. patent Includes an internal tag gen- 
erata that -generates tags when respective addresses 
(up to 256) selected by the user are captured by the 
probe. Since the Agarwal et aL system does not use 
instrumented code techniques or otherwise correlate 25 
tags generated from the captured addresses with 
respective software locations, the Aganwal et al. system 
does not provide easy to use and understand informa- 
tion about the execution of the software. 

There is therefore a need for a method and appara- 30 
tus that can analyze the execution of software In an 
embedded system without the requirement that the 
embedded system have on-board data storage and/or 
output port capabilities In a manner that does not con- 
sume system resources, including memory, processor 35 
time and I/O resources, of the target system. 

Summary of the invention 

The inventive method and apparatus analyzes 40 
embedded software being executed in a target system 
having a data bus and an address bus. A plurality of 
executable tag statements are first inserted in the 
source code. Each of the tag statements, when exe- 
cuted, causes the target system to write a tag to a pre- 45 
determined location in the address space of the target 
system. The tags contain respective tag values so that, 
by the proper placement of tag statements in the source 
code, the tag values identify the respective locations in 
the source code of tag statements generating the tags, so 
After the source code has been instrumented, it is com- 
piled to obtain object code which then executes in the 
target system. During execution of the object code, the 
address bus of the target system is monitored to detect 
when the predetermined location in the address ^ace 55 
of the target system is being addressed. The data bus of 
the target system is also monitored to capture a tag on 
the data bus when addressing of the predetermined 
location is detected. Based on the respective tag values 



of the captured tags, the inventive method and appara- . 
tus is able to determine the source code locations that 
are being executed. 

The tags generated by respective tag statements 
can be of two types, i.e., control tags and data tags. 
Control tags include a data'f ield having a tag value cor- 
responding to the location in the source code of the tag 
statement generating the tag, as explained above. Data 
tags are always associated with a specific control tag, 
and they have a data field that provides information 
about an event identified by the control tag with which it 
is associated. Control tags may also have a tag type 
field that identifies, the analysis function for which the 
tag is used. 

The inventive method and apparatus is able to per- 
form a wide variety of software analysis functions. Per- 
formance analysis can be accomplished by recording 
first and second times when respective first and second 
tags are present on the data bus. The first and second 
tags have respective tag values corresponding to the 
location in the source code of first and second tag state- 
ments generating the first and second tags. Based on 
the difference between the first and second times, the 
time required to execute the software between the first 
ard second locations is determined. 

Memory allocation analysis can be accomplished 
by inserting control tag statements in the source code at 
a locations that will cause them to be executed along 
with memory allocation statements. An executable data 
tag statement Is also inserted along with each control 
tag to write a data tag to a second predetermined loca- 
tion in the address space of the target system. The data 
value of the data tag indicates the memory being allo- 
cated by the memory allocation statement. The inven- 
tive method and apparatus detects when the second 
predeterhnlned location in the address space of the tar- 
get system is being addressed, to capture data tags on 
the data bus. The memory allocation resulting from the 
memory allocation statements are then determined 
based on the data values of the captured data tag. 

Function linking can be analyzed by inserting tag 
statements in the source code at locations causing 
respective tag statements to be executed along function 
call statements. Based on the order in which the tags 
are captured when addressing of the predetermined 
location is detected, the inventive method and appara- 
tus determines which functions of the source code are 
linked to other functions of the source code. 

The inventive method and apparatus performs code 
coverage analysis by inserting tag respective state- 
ments in basic blocks of the source code so that the tag 
statements will be executed along with the basic blocks. 
Based on the tag values of the tags captured when 
addressing of the predetermined location is detected, 
the inventive method and apparatus determines which 
basic blocks of the source.code have be&^ executed. 
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Brief Description of the Drawings 

Rgure 1 is an isometric view of a preferred embod- 
iment of the inventive software analysis system. 

Rgure 2 is a schematic and block diagram of the s 
software analysis system of Figure 1 and its manner of 
use. 

Rgure 3 is a more detailed block of the software 
analysis system of Rgure T. 

Rgure 4 is a block diagram of the communications io 
and control circuit shown in the block diagram of Figure 

3. 

Rgure 5 is a block diagram of a data reduction proc- 
essor shown in the block diagram of Figure 3. 

Rgure 6 is a block diagram of a tag buffer shown in 75 
the block diagram of Rgure 3. Figure 7 is a block dia- 
gram of a tag preprocessor shown in the block diagram 
of Figure 3. 

Rgure 7 is a block diagram of a tag preprocessor 
shown in the block diagram of Figure 3. 20 

Rgure 8 is a screen display of the command win- 
dow for the software analysis system of Figure 1 . 

^ Rgure 9 is a screen display showing the results of 
two different types of software performance analysis 
performed by the software analysis system of Figure 1. 25 

Rgure 10 is a screen display showing the results of 
a memory allocation analysis performed by the softwai^e 
analysis system of Figure 1 . 

Rgure 11 is a screen display showing the results of 
a call linkage analysis performed by the software analy- 30 
sis system of Rgure 1 . 

Rgure 12 is a saeen display showing the results of 
a code coverage analysis performed by the software 
analysis system of Figure 1 . 

Rgure 1 3 is a screen display showing another pres- 35 
entation of the results of a code coverage analysis per- 
formed by the software analysis system of Figure 1 . 

Rgure 14 is a saeen display showing the results of 
a high level trace analysis performed by the software 
analysis system of Figure 1 . 40 

Rgure 15 is a saeen display showing the results of 
a more detailed trace analysis performed by the soft- 
ware analysis system of Figure 1 . 

. Rgure 1 6 is a screen display showing the results of • 
arKDther more detailed trace analysis performed by the 45 
software analysis system of Rgure 1 . 

Detailed Description of the Invention 

One embodiment of a software analysis system 10 so 
in accordance with the present invention is illustrated in 
Figure 1. The system 10 includes a probe tip 12 that 
clips onto the microprocessor of a target system (not 
shown) in a conventional manner. As a result, the exter- 
nal connector pins of the target system microprocessor. 55 
including its data bus and address bus, are accessible 
to the probe tip 12. The probe tip is connected through 
a conventional ribbon conductor 1 8 to a probe chassis 
20 containing most of the electronics for the system 10. 



The probe chassis 20 is, in turn, connected through a 
suitable cable 30. such as an Ethernet cable, to a host 
system 40. The host system 40 is essentially a conven- 
tional computer having a processor chassis 42 with a 
disk drive 44, a CRT monitor 46 with a display screen 
48, and a keyboard 50. The host system 40 preferably 
uses a Unix® or Windows® user interface and operat- 
ing systenri. Application specific software is loaded 
through the disk drive 44 to cause the host system 40 to 
properly interface with the probe chassis 20. receive 
appropriate configuriation and operating commands 
through the keylx>ard 50. and display analysis results, 
on ttie screen 48/ 

The use of the software analysis system 10 is illus- 
trated in Figure 2. Source code 60 written to run on a 
target system is first insfrumented by inserting tag state- 
ments 62 in the source code 1 0 at various locations that 
the user is interested in analyzing. For example, if the 
user is interested in determining code coverage, the 
user will insert a tag statement 62 in each branch of the 
source code 60. and the system 10 will determine which 
ot the branches have been executed based on whether 
each tag statement has been executed. Other analysis 
functions are described in detail below. The insertion of 
tag statement 62 in the source code 60 results in instiru- 
mented source code 64. When tiie instrumented code 
64 is produced, a symbol database 65 is also created 
which provides a record correlating each of the tag 
statements to their locations in the source code 10. The 
instrumented source code 64 is compiled in a conven- 
tional manner at 66 thereby resulting in executable code 
68. The executable code 68 is then loaded into the tar- 
get system T by any suitable means. For example, the 
executable code may be stored in a programmable 
read-only memory ("PROM^ that is installed in tiie tar- 
get system T The executable code 68 may also be exe- 
cuted in the target system T through a conventional 
emulator (not shown). Regardless of how the executa- 
ble code 68 is loaded into the target T. the target T is 
then allowed to execute the code. The probe tip 12 clips 
on to tiie target system T in a conventional manner to 
make electrical contact with at least the address bus 
and the data bus of the target system T Tags generated 
by tiie execution of tag statements 62 and collected by 
the probe tip are transferred to the probe chassis 20 
through the ribbon cable 18. After the probe chassis 20 
has performed various tabulation and data reduction 
functions on the data from tiie probe tip 12, it outputs 
appropriate data to the host system 40 through the local 
area network cable 30. Host application software 70 
includes processing routines 72 that store data in and 
retrieve data from data files 74, and tiie host application 
software 70 also includes a graphical user interface 75, 
such as the X-1 1 or Microsoft Windows® interface, that 
works with the processing routines 72 to operate on the 
data files 74 and provide various displays of analysis 
data. The processing routines 72 also receive the sym- 
bol database 65 so that the tag execution data in the 
data files 74 can be correlated with the location of the 
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tag statements in the source code 65 in order to provide 
reports and displays that specify performance in terms 
of source code locations and branches. The symbol 
database 65 is preferably loaded into the host through 
the disk drive 44 (Rgure 1). The. host application soft- 
ware 70 also includes data structure 76 for storing and 
handling the analysis data, and communications soft- 
ware 78 for providing communication with the target 
access probe 20. 

In operation, each of the tag statements 62 gener- 
ate a respective tag containing a data field having a "tag 
value" that is generally unique to the location of the tag 
statement in the source code 60. Thus, for example, a 
first branch may contain a tag statement having a tag 
value of 1 . A second branch may contain a tag state- 
ment having a tag value of 2, and so forth. When the tag 
statement 62 is executed by the target T a processor in 
the target T writes a tag containing the tag value to a 
predetermined location in the address space of the tar- 
get system T As explained in greater detail below, the 
tag 62 may also contain at least one other field provid- 
ing information about its function or location of its asso- 
ciated tag statement 62 in the source code 60, More 
specifically, the tag, statement 62 preferably writes a tag 
consisting of 32 bits which includes not only a data field 
word having atag value, but also a number of bits which 
define the type or category of tag. For example, different 
tag types may identify function entry and exit points, 
branch points, and memory allocation statements. Tags 
having a tag type field to identify the tag type are known 
as "control tags.- In the preferred embodiment of the 
system 10, all control tags are written to the same loca- 
tion in the address space of the target, tn addition to 
control tags, the system 10 also utilizes data tags. Data 
tags accompany control tags and are written to a sec- 
ond location in the address space of the target to pro- 
vide additional information relevant to a particular 
control tag. For example, a control tag may indicate that 
a memory allocation is taking place, and two data tags 
accompanying the control tag may indicate the size of 
the memory allocation and the memory pointer associ- 
ated with that allocation, .respectively. Since only a sin- 
gle location in the address space of the target system 
preferably is used for control tags and a relatively few 
locations used for data tags, the inventive system 10 
does not significantly use the memory resources of the 
target system, thus making the analysis system sub- 
stantially transparent to the target system. 

The probe tip 12 monitors the address bus and the 
data bus of the target T and determines when the proc- 
essor addresses the predetermined location(s) in the 
address space of the target system T The probe tip 12 
then captures the tag value currently on the data bus. 
As a result the currently captured tag value indicates 
the location in source code 60 cun'ently being executed. 
Moreover, the system 10 monitors the execution of the 
software in the target T in essentially real time since the 
probe 20 receives each of the tag values as it is cap- 
tured and performs various functions using the tag 



value. For example, for some software analysis func- 
tions, the probe 20 associates an execution time with 
. the tag value so that the execution time between a pair 
of tag statements can be determined. The probe chas- 

5 sis 20 may also perform various data reduction opera- • 
tions on the tag value, such as, for example, call pair 
analysis (i.e. , generating statistics on functions that are 
called by other functions) allocation. Basically, the sys- 
tem 10 is capable of determining function and task exe- 

70 cution times, coverage analysis, i.e., identifying portions 
of the source code executed or not executed, memory 
allocation analysis [i.e., identifying how much memory 
each allocation statement in the source code allocates 
and identifying specific allocation errors), and program 

j5 tracing (i.e., creating a sequential history of the execu- 
tion of the source code). Once again, the probe chassis 
20 performs these functions in essentially real time. 
Finally, the probe chassis 20 communicates with the 
host 40 to upload the data and allow it to be displayed 

20 by the host 

The software analysis system 10 of Figures 1 and 2 
is shown in greater detail in the block diagram of Figure 
3. With reference to Figure 3; the probe. tip 12 includes 
a conventional LCA commercially available from Xilinx 

25 that is programmed by information downloaded from the 
host 40 through the probe chassis 20 to monitor one or 
more predetermined addresses on the address bus. 
When the probe tip 12 detects that one of the predeterr 
mined addresses is active, it clocks the tag on the data 

30 bus into the probe tip 1 2. As the probe tip 1 2 must Inter- 
face with a specific microprocessor used by the target 
system T the probe tip is usually specific to the particu- 
lar microprocessor used by the target T. However, the 
probe tip 12 is the only target processor specific portion 

35 of the system 1 0. The probe tip 1 2 pr^erably also mon- 
itors the status bus of the probe tip 12 so that it can 
detect a write function to one of the predetermined 
addresses. 

When the probe tip 12 captures each tag, it passes 

40 the tag to a tag preprocessor 1 00 which also receives a 
time stamp from a time stamp generator 102. The tag 
preprocessor 100 pairs the current time stamp value 
from the time stamp generator 102 with the tag values 
received from the probe tip 12. It also determines where 

45 the time stamped tag values are to be routed based on 
the tag type. As explained above, the tag type Is defined 
by the value in the tag type field in the tag received from 
the probejip 12. More specifically, if the tag is a cover- 
age analysis tag generated by a tag statement placed in 

50 a branch of the source code to determine if the branch 
is executed, the tag is passed directly to a code cover- 
age data reduction processor and database 1 10. All tag 
types other than coverage analysis tags are passed to a 
tag buffer 1 12. It Is desirable to process the code cover- 

55 age tags separately from the other tags because cover- 
age tags are generally far more frequent than other 
types of tags. The tag preprocessor 100 also preferably 
performs some qualification on the tags before passing 
them to the tag buffer 112 or code coverage data reduc- 
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tion processor and database 110. More specifically, the . 
tag preprocessor 100 preferably passes only the tags 
for the measurement being performed to minimize the 
number of tags that must be processed and thereby 
maximize the speed of downstream circuitry. The tag 
preprocessor 100 is preferably implemented using a 
conventional LCA commercially available from Xilinx . 
that is programmed by information downloaded from the 
host 40 through the probe chassis 20 to perform the 
functions described above. 

The code coverage data reduction processor and 
datat>ase 1 10 is preferably a hard-wired logic circuit, but 
it may also be implemented using a microprocessor and 
associated circuitry. The code coverage data reduction 
processor and database 110 converts captured code 
coverage tags to indices in a code coverage data base 
array. Each bit in the array represents a single tag value 
corresponding to the location in source code 60 in which 
the corresponding tag statements were inserted at 62 
(Figure 2). Thus, the contents of the array, which may be 
downloaded to the host 40, provides an indication of all 
instrumented branches of the source code that have 
been executed: 

The tag- buffer- 1 12 is a high speed buffer that tem- 
porarily stores the tags received from the tag preproces- 
sor 100. The tags are then passed on to a data 
reduction processor 114. The tag buffer 112 is used to 
accommodate tags received in bursts at a much faster 
rate than can be handled by the data reduction proces- 
sor 114. The tag buffer 112 can accohirhodate high ' 
speed bursts of tags from the tag preprocessor 100 as 
long as the average rate of tags passed by the tag pre- 
processor 100 does not exceed the processing rate of 
the data reduction processor 114. 

The communications and control circuit 120 is illus- 
trated in greater detail in Rgure 4. The interface 
between the probe chassis 20 and the host 40 consists 
of a standard Ethernet communication channel. The 
Ethernet transmission status signals are routed through 
a communications port 130 to a status port 132. The 
communications port 130 is preferably implemented 
with a Motorola MC68340 control processor. 

As explained in greater detail below, a controlproc- 
essor 134 handles commands from the host software 
and initialization of the probe chassis 20, The control 
processor 134 also has direct access to the communi- 
cations port 130 and a control memory 136. The control 
processor 130 is preferably an MC68340 microproces- 
sor. The control memory 136 stores the instructions for 
the control processor 1 34 software as well as data stor- 
age for the control processor 134. The control memory 
136 is preferably non-volatile memory, such as flash 
memory for code storage and DRAM for data storage. 
As explained in greater detail below, the control proces- 
sor 134 has dual port access to the database memory 
118 and database 110 to transfer data to the control 
memory 136. 

The data reduction processor 114 is illustrated in 
greater detail in Figure 5. The data reduction processor 



114 includes a data reduction microprocessor 1 40 hav- 
ing a data bus 142, an address bus 144 and a control 
and status bus 1 46 connected to the data base memory 
118 (Rgure 3). The data reduction microprocessor 140 

.5 is also connected to data and code storage memory 
150. the tag buffer 112 and an I/O port 160 through 
these busses 142, 144, and 146. The data reduction 
microprocessor 140 processes tags from the tag buffer 
112 (Figure 3), as explained above, under control of 

10 instructions from the code storage memory 150. The 
data reduction microprocessor^140 also communicates 
with the control processor 134 (Figure 4) using the I/O 
port 160, and a decoder 132. The control processor 
accesses data in the data base memory 118 through 

75 the I/O port 1 60 under the control of the DMA and inter- 
rupt channels of the data reduction microprocessor 1 40. 
The DMA channel of the data reduction nrucroprocessor 
1 40 transfers data to or from the data base memory 1 1 8 
and to or from the I/O port 160 each time the control 

20 processor 134 reads from or writes to the I/O port 160. 
This provides the control processor 134 dual port 
access to the data base memory 1 18. As a result, rela- 
tively inexpensive DRAM may be used in the data base 
memory 1 18 as dual ported memory between the data 

25 reduction microprocessor 1 40 and the control processor 
134. Furthermore, the control processor 134, which is 
relatively slow, is able to effectively access the data 
base memory 1 18 using only a single bus cycle of the 
data reduction microprocessor 140 and minimizing the 

30 delay to the data reduction calculations. - 

The data reduction processor 1 14 performs most of 
the functions in the probe chassis 20. The data reduc- 
tion processor 114 processes tags from the tag buffer 
112 and stores resulting data in structured form in a 

35 database memory 1 1 8 for various types of performance 
analysis such as memory allocation, execution time, 
real time trace, etc. Thus, the database memory 118 
stores data resulting from the capture of all of the tags 
other than code coverage tags. By extracting and saving 

40 pertinent data from the tags and then discarding the 
tags, the required capacity of the database memory 1 1 8 
can be relatively small. Also, the required memory 
capacity is dependent only on the number of functions 
or task instances being monitored and not the number 

45 of tags received from the tag buffer 1 12. As a result of 
the database structure (i.e., the size of the database is 
proportional to the number of events monitored rather 
than the number of occurrences of such events), analy- 
sis of a software program can run for an indefinite 

50 period of time to be sure that the software is adequately 
tested and yet no data is missed.- /.a. the measurement 
is non-sampled. 

In order for the data reduction processor 114 to 
make meaningful measurements of an embedded soft- 

55 ware program, it must track the software execution con- 
text- Since most modem embedded programs use some 
kind of real-time operating system ("RTOS"), this means 
that the data reduction processor 1 14 must be aware of 
the RTOS execution context. 
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Three events which are controlled by the RTOS 
must be tracked: when a task is created, when a task is 
deleted, and when a task switch (swap) occurs. In order 
to accomplish this, a second instrumentation step 
(beyond application program source instrumentation) is 5 
required. Most modern commercial RTOS provide call 
outs which conveniently allow a user supplied software 
function to execute when a specific RTOS event occurs. 
A simple function linked into the appropriate call outs for 
the above three RTOS events outputs the appropriate w 
control tag to indicate the kind of RTOS event, and one 
or more data tags to uniquely identify the RTOS task(s) 
affected. In a similar fashion, custom-built RTOS may be 
easily modified to emit the appropriate tag as well. 

The data reduction processor 1 1 4 takes a different is 
action depending upon which RTOS tags are received. 
When a "task create" tag is received, the data reduction 
processor .114 establishes in memory a stack area for 
the task When a "task delete" tag is received, the data 
reduction processor. 1 1 4 deletes the stack after tabulat- 20 
ing any remaining measurement results into the appro- 
priate data base. When a "task switch" tag is received, 
the data reduction processor 114 suspends any meas- 
urement activity for the current task stack, and switches 
to another stack which corresponds to the task ID 25 
received (as a data tag). 

The data reduction processor 114 also tracks con- 
text at the function level within each task using tags 
emitted at each function entry and exit point. When a 
switch to a task occurs, the data reduction processor 30 
114 will receive a function entry tag from the first func- 
tion in the task, and will record the entry on the stack 
{e.g. function "A"). If a second function ("8**) entry tag is 
received prior to the exit tag for function A, function B's 
entry tag is recorded on the stack, and the data reduc- 35 
tion processor 114 "knows" that a function nesting has 
occurred, i.e. A has called B. For performance meas- 
urement purposes, the time stamp corresponding to 
each tag is recorded on the stack as well. 

When a context change occurs such as a task swap 40 
{e.g.; from task "Y" to task "Z"), the current time is 
recorded on Y's stack such that no further execution 
time will be attributed to it while the program executes 
other tasks. The data reduction processor 114 then 
switches to the stack corresponding to task Z and 45 
begins tracking time for each tag emitted while execut- 
ing task Z. Should the RTOS swap back to task Y, the 
times and function nesting of task Z are "frozen", as 
described for task Y above. The data reduction proces- 
sor 114 then points back to Y's stack, and the appropri- so 
ate timers resume counting time where they left off. 
Since the function hierarchy context of task Y has been 
preserved on Y's stack, the system is able to accurately 
track, the continuation of task Y's activity When a 
"delete task'* tag is received, any execution information 55 
preserved on the task's stack is tabulated a final time in 
the appropriate data base. 

This context tracking method enables many sophis- 
ticated qualifications of program measurements based 



upon software execution context. Performance meas- 
urements may be qualified such that function execution 
time is tracked only when the program is executing a 
particular task, thereby eliminating executions from a 
different context of functions shared between two or 
more tasks. While performance measurements have 
been described as a typical example, , other measure- 
ment qualifications are equally possible and desirable. 
For example, a trace history measurement can also be 
qualified by the software context such that tags will only 
be stored In the trace buffer when executing in a partic- 
ular task, or a particular function nesting hierarchy. 
Memory allocation could be tracked only when the pro- 
gram is executing in a particular task context, etc. 

The data reduction processor 1 1 4 performs call pair 
measurements by tracking which functions called other 
functions by identifying consecutive function entry tags 
generated by respective tag statements in the source 
code for the calling and called functions. The data 
reduction processor 1-14 updates this information each 
time a new function entry tag is received. The resulting 
data can be stored as either a count of executions of 
each call pair or a flag indicating at least one execution 
of each call pair. 

Rnally, the data reduction processor 114 performs 
memory allocation measurements based on receiving 
from the tag buffer 1 12 memory allocation tags gener- 
ated by tag statements inserted into allocation state- 
ments in the source code. These memory allocation 
tags (including control tags and data tags) indicate how 
much memory was allocated or freed by each call to a 
memory allocation function. 

The design goal for memory allocation tagging is to 
record successful allocations and deallocations, includ- 
ing the original allocation size and site (caller identifier). 
and allocation errors, including block overwrites, block 
underwrites and heap corruption (/.e. writes out of 
bounds references), writes to deallocated blocks, and 
enroneous arguments to interface routines (e.g. wild 
pointers). 

Implementing memory allocation tagging includes 
an error-checking memory allocator and an instru- 
mented interface to it, a set of instrumentation rules for 
modifying user code, and a set of replacements for the 
standard memory allocation routines. The error-check- 
ing memory allocator is based on a straight forward 
heap-based memory allocator. The interface to the allo- 
cator is based on the standard memory allocation rou- 
tines, augmented with the addition of a memory 
management tag (e.g. augmented-maltoc). The tag 
encodes the kind of the memory [de]allocation call (e^g. 
malloc, realloc, free, etc.), and the caller identifier. Infor- 
mation about each allocation is kept, including the 
requested size and the caller identifier of allocation site; 
for later reference when the block is deallocated, or an 
error is discovered in the block. 

When a block is successfully allocated, a data and 
control tag are written to announce the allocation, 
including the size for the allocated block (a data tag), 
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and the kind and caller identifier of the aJlocation (a'con- 
trol tag). When a block is successfully deallocated, a 
data and control tag are written similar to that for a suc- 
cessful allocation, including the size for the allocated ' 
block, the kind of the deallocation, and the caller identi- 5 ' 
fler of the allocation. 

The base allocator Is augmented with error check- 
ing, including verification of the arguments to the alloca- 
tion and deallocation routines, the integrity of each block 
present in the heap, whether currently allocated or 10 
freed, and the integrity of the heap as a whole. When an 
error is identified, a set of data and a control tag are 
written to indicate the error. The information present in 
the tags include an error identifier, the address of the 
block in error and its size (if any), the caller identifier(s) i5 
of the block's allocator and deallocator (if any), and the 
kind of allocator call begin attempted when the error 
was discovered. 
- Instrumented C code, which calls the standard 
memory allocation routines, is changed to replace the 20 
original calls with calls to the corresponding instru- 
mented interface, which allows for the addition of a 
memory management tag, as described above. Unin- 
strumented C code, which calls the standard memory 
allocation routines (e.g. precompiled libraries), is pro- 25 
vided for by.a set of routines with the same signature as 
the standard routines, but which call the corresponding 
instrumented interface, and pass an "unknown" caller 
identifier 

In addition to the provisions made for C code as 30 
described above, instrumented C++ code must also 
handle the use of the global versions of operators new 
and delete. 

For instrumented C++ code which calls the default 
operator new, a file local definition is supplied, using 35 
placement syntax, which augments the standard opera- 
tor new signature with a memory management tag argu- 
ment. Uses of the default operator new are replaced 
with calls to the augmented version, whose definition 
simply calls the instrumented interface to the allocator 40 
(i.e. augmented-malloc). For uninstrumented C++ code, 
a default version of global operator new is provided 
which calls augmented-malloc with an "unknown" caller 
id. 

For instrumented C++ code which calls the default 45 
operator delete, a file local definition of the default 
delete operator is provided which does nothing. Calls 
the (now, do nothing) operator delete are followed by a 
call to the instrumented interface (ie. augment ed-free), 
along with an appropriate memory management tag. so 
For uninstrumented C++ code, a default version of glo- 
bal operator new, is provided which calls augmented- 
f ree with an "unknown" caller id. 

Returning to Figure 3, the probe chassis 20 com- 
municates with the host 40 through a communications 55 
and control circuit 120. Under command of the host 
processor 40, the communications and control circuit 
120 can directly access data stored in the database 
memory 1 18 or the code coverage data reduction proc- 



essor and database 1 1 0 so that such data can be trans- 
ferred to the host 40 for further processing and display 
The communications and control circuit 120 also routes 
cornmands from the host 40 to the probe chassis 20 to 
select the mode of probe operation, including specifying 
the function to be performed and the tag types to be col- 
lected. 

The tag buffer 112 (Rgure 3) is shown in Figure 6 
along with its interface to the data reduction processor 
1 14 ■ As explained above, tags are often captured by the 
probe tip 12 in bursts at rates that exceed the maximum 
processing rate of the miaoprocessor 140. One appar- 
ent solution to averaging the tag capture rate is to use a 
first-in first-out ("FIFO") buffer. However. FIFO buffers 
capable of operating at high rates of speed having suffi- 
cient capacity to store large numbers of tags are rela- 
tively expensive. The tag buffer 1 1 2 illustrated in Rgure 
6 is able to effectively implement a large capacity, high 
speed FIFO buffer using a high speed, low capacity 
FIFO buffer 1 70 of conventional design. The FIFO buffer 
1 70 normally receives tags from the tag preprocessor 
100 (Rgure 3) and sequentially outputs those tags to 
the microprocessor 140. The microprocessor 140 then 
stores the tags in the DRAM 150 while awaiting data 
reduction arid processing. However, in the event that the 
relatively low capacity FIFO buffer 170 becomes filled, it 
outputs a bit to the direct memory access ("DMA") input 
of the microprocessor 140. The microprocessor 140 
then allows the FIFO buffer 1 70 to write data directly to 
the DRAM 150, thereby speeding up the writing of data 
in the DRAM 150- 

As mentioned above, the tag preprocessor 100 
combines the tags received from the probe tip 12 with a 
time stamp received from the time stamp generator 1 02 
and routes them to either the data reduction processor 
1 14 or the code coverage data reduction processor and 
database 110. The tag preprocessor 100 is shown in 
greater detail in Figure 7. A clock and control circuit 180 
interfaces with the time stamp generator 102 (Figure 3), 
a clock signal received from the probe tip 12 and control 
bits from the data reduction processor 114. The clock 
and control circuit 180 then controls the operation of 
other components in the tag preprocessor 100. The tag 
preprocessor 100 includes a probe tip latch 182 that, 
when triggered by the clock and control circuit 180, 
latches into the tag preprocessor 100 the tag type field 
and the tag value. Based on the tag type, a code cover- 
age tag splitter 184 routes the tag to either the code 
coverage "data reduction processor and database 110 
(Figure 3) via bus 1 88 or to a tag multiplexer 1 90 via bus 
192. The tag preprocessor 100 also includes an internal 
tag generator 194 that can apply an internal tag to the 
tag multiplexer 190. The data reduction processor 114 
controls the tag multiplexer 190 to apply either the inter- 
nal tag on bus 196 or the tag from the probe tip 12 on 
bus 192 to the tag buffer 112. Finally, a synch latch 198 
latches in the time stamp at the appropriate time under 
control of the clock and control circuit 180 so that the 
time stamp is synchronized to the currently captured 
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tag? 

The user interface for the host system 40 is best 
illustrated With reference to the user Interface command 
bar shown in Figure 8. During the operation of the soft- 
ware anal^is system 10, the display screen 48 of the 
monitor 46 (Rgure 1) displays a title bar 230 at the 
upper portion of the screen. A command bar 232 for 
entering commands into the system 10 is positioned 
below the title bar 230. Rnally. a tool bar 234 adapted to 
allow direct entry of commands available in the com- 
mand bar 232 is positioned beneath the command bar 
232. Most of the file commands available in the com- 
mand bar 232 may be directly selected by clicking on 
appropriate icons of the tool bar 234 using a pointing 
device, such as a mouse. A new file icon 240 causes the 
system to save unsaved data, closes any open views on 
the screen and invokes a configuration dialog to allow 
configuring for a hew task. An 'open' icon 242 invokes a 
dialog for loading and displaying analysis results saved 
from a prior test A "save icon 244 invokes a file save dia- 
log to save analysis data resulting from a test. The save 
command presumes that the data has already been 
given a file' name. If not, the file save dialog requests the 
user to enter a file name under which file data is saved. 
A "print" icon 246 invokes a print dialog which allows the 
software analysis system to print reports showing anal- 
ysis data or subsets of data. A print preview icon 248 
allows the viewer to view on the screen how the printed 
document will appear. The user can exit the Windows 
software by either double-clicking on ah exit bair 250 or 
selecting "exit" as a file command in the cornmand bar 
232. 

The edit command in the command bar 232 con- 
sists of a single command, namely, a "copy" command. 
This command, which can be entered by selecting a 
"copy" icon 260 in the tool bar 234 copies selected data 
into a clipboard (i.e., tennporary storage) so it can be 
pasted into another application such as a spreadsheet 
program. 

Several run commands available from the com- 
mand bar 232 may also be entered through the toot bar 
234. A "run" Icon 270 erases any previously acquired 
data and begins the acquisition of data from the probe 
12 while performing an analysis function. A "halt" icon 
272 halts data acquisition from the probe until a resume 
Icon 274 is selected. There are a large number of data 
commands that can be selected from the command bar 
232 or from the tool bar 234. A "sort ascending" Icon 
280 sorts in an ascending order active data acquired 
from an analysis by values In the selected column. Sim- 
ilarly, selecting a "sort descending" icon 282 causes the 
acquired data to be sorted in a descending order 
Selecting a "sort multiple" icon 284 invokes a sort dialog 
for setting up a multi-level sort. 

An "edit filter" icon 286 Invokes a filter dialog for set- 
ting up a data filter for an active view. RItering a display 
causes only selected measurement results to be dis- 
played, i.e. , only the functions of Interest. An "apply cur- 
rent filter" icon 288 causes the system to apply a 



previously specified filter to the active data view. A 
"show all" icon 290 removes the data filter so that all of 
the acquired data is displayed in the active view. A "find" 
icon 292 invokes a find dialog for setting up a search 

5 within an active view. 

A variety of data commands can also be entered 
through the command bar 232 or directly through the 
tool bar 234. A "function performance* icon 300 is 
selected to invoke a function performance table to dis- 

10 play function performance data that has been acquired 
from the probe or loaded from a file stored form a previ- 
ous analysis, A .task performance" commarxl can be 
selected from the view menu in the command bar 232, 
but there is no corresponding icon In the toolbar A "lask 

15 performance" command displays previously acquired 
task performance data from either the probe or a file. A 
"call linkage" performance icon 302 Invokes a call link- 
age table to display call pair data from the probe or from 
a file of call pair data acquired In a previous test. A 

20 *TDranch coverage" Icon 304 is selected to invoke a 
branch coverage table to display coverage data from the 
probe or from a file saved from a previous test. A cover- 
age summary graph icon 306 Invokes a coverage sum- 
mary graph to display a statistical record of coverage 

25 data from the probe or from a file stored from a previous 
analysis. A "memory allocation" icon 308 Is selected to 
invoke a memory allocation table to display memory 
allocation data acquired from the probe or from a file 
saved from a previous test. Finally, a "trace analysis" 

30 icon 310 invokes a trace view In the display window to 
display trace data acquired from the probe or f rorh a file 
saved from a previous test. 

The command bar 232 also allows standard Win- 
dows® commands, such as hiding or showing the tool 

35 bar 234. cascading or tiling open view windows, arrang- 
ing icons, etc. The tool bar 234 also includes an index 
icon 250 to invoke a top level contents page for on-line 
help in operating the system 10 and a second "help" 
icon 232 which may be "dragged" and "dropped" to any 

40 item on the display to obtain help about that Item. Thus, 
the Windows® user interface allows the software analy- 
sis system to be easily and quicWy operated by rela- 
tively inexperienced personnel. A similar user interface 
running on UNIX® workstations utilizing, a X-11 win- 

45 dowing system provides similar ease and speed of use. 
Examples of performance analysis displays are 
illustrated In Rgure 9 for both task performance and 
function performance. The function perfornriance display 
310 includes a first column 312 listing various functions 

50 performed by the source code followed by a column 3 1 4 
showing the number of times each of those functions 
was executed. Time columns 316, 318, 320 then show 
the minimum, maximum, and average time, respec- 
tively, required to execute each of the functions listed in 

55 the column 312. The cumulative time spent executing 
each of the functions (/.e., the product of, the number of 
executions and the average) is then displayed In column 
322. Finallyj column 324 displays the percentage of time 
that each of the functions listed in the first column 312 
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were being executed. The data in column 324 can be 
calculated as the ratio of each entry in column 322 to 
the sum of the entries in column 322. 

A task performance analysis display screen 330 is 
similar to the function performance analysis display 
screen 310 and, in the interest of brevity, its explanation 
will not be repeated. The performance analysis ratios- 
shown in column 324 can also be displayed as a bar 
graph histogram. 

As explained above, the software analysis system 
1 0 can also perform a dynamic analysis of memory allo- 
cation, and an example of the display of data from such 
analysis Is shown in Figure 10. A memory allocation 
screen 350 includes a first column 352 listing each of 
the functions containing a memory allocation statement. 
A second column 354 lists the source file for each of 
those functions. The next column 356 lists the number 
of times each of those functions were executed and the 
next three columns 358, 360, 362 lists the snnallest 
memory allocation, the largest memory allocation and 
the average memory allocation, respectively. The final 
column 364 contains a bar graph and a digital display of' 
the memory bytes currently allocated. By viewing the 
'bar graph in column 364, the operator can examine in 
essentially real time the allocation of mertiory in the tar- 
get system as the software is being executed. Appear- 
iiig with the memory allocation display 330 is a memory 
error display 370 that lists each of the memory errors 
found during the memory allocation analysis. 

An example of a call linkage table resulting from a 
call pair analysis is shown in Figure 11. A call linkage 
display 380 dynamically tracks a nunnber of function 
linkages by listing in a first column 382 the calling func- 
tions and in a second column 384 called functions. The 
number of times each of the calling functions has called 
the called function is then listed in a tiiird column 386 in 
both digital and bar graph form. 

As explained above, the source code can be instru- 
mented by placing a tag statement in each branch to 
assess call coverage, i.e., the number of branches exe- 
cuted and the frequency of execution of each branch. 
An example of a code coverage display 400 Is illustrated 
h- Figure 12. The code coverage display 400 includes 
a bar graph 402 showing the overall level of coverage 
achieved during a test. Functions are categorized in 
percentile ranges along the vertical axis, and the 
number of functions that fall within each range grouping 
is indicated on the horizontal axis. The total number of 
functions and tasks not executed are listed at the bot- 
tom of the display at 404. This listing 404 can alert the 
operator to portions of the software that are apparentiy 
not being executed. An alternative code coverage dis- 
play 408 consists of a line graph 410 depicting the per- 
centage of coverage achieved over the period of time 
conducting the test, as illustrated in Figure 13, The code 
coverage graph 410 of Figure 13 shows that 15% of the 
code was executed during tiie first minute, the rate of 
code coverage increased only marginally for the next 
two minutes, and the rate of code coverage then 



increased at a much faster pace for the next two min- 
utes until leveling off at 40% coverage. 

TTie trace function as described above can be dis- 
played in at least three different modes. A high level 

5 trace display 420 shown in Rgure 14 is preferably tiie 
" default view upon entry in the trace mode. The display 
420 contains a time ordered list of nested function entry 
. and exit points and RTOS task events. The display 
includes a column 422 showing the source file for the 

10 software, a column 424 showing tiie functions in tiie 
order that they are executed, a column 426 showing a 
line number of that function, and a column 428 desig- 
nating whether the traced function was an entry or exit 
point. A relative time stamp for each function Is listed in 

15 a right-hand column 430. Alternatively, the results of a 
trace can be displayed in a control flow display 440 
shown in Rgure 15. A control flow display shows time- 
ordered listing of ail function points, executed branches 
and real time operating system events in the trace 

20 buffer. As with the high level display 420. the control flow 
display displays the source file In a first column 442, the 
tasks, functions, and branch points in the order that they 
are executed in a third colurrin 444, whether the function 
is an exit point, an entry point, or a branch in column 

25 446, and the line number of the poirrt in column 448, As 
before, the right hand column 450* lists a relative time 
stamp for each point. Finally, the results of a trace can 
be displayed in a source view display 460 shown in Fig- 
ure 16. A source view display shows every line of exe- 

30 cuted software, although loops can be expanded or 
collapsed. The display 460 interpolates source lines 
which, by inference, were executed. This determination 
of execution is made by retrieving those source code 
lines which comprise the basic block in which each 

35 branch tag is located. Function entries and exits, 
branches, RTOS events, and other executed lines of 
software of interest preferably may be cdor coded. As 
with the other trace displays shown in Figures 14 and 
15, the source view display 460 displays tiie source file 

40 in a first column 462, the functions in the order that they 
are executed in a third column 464, whether the function 
is an exit point, an entry point, or a branch in column 
466, the line number of the point in column 468, and a 
relative time stamp for each point in column 470. 

45 It will be apparent to one skilled in the art that the 
various analysis functions that tiie software analysis 
system 10 is capable of displaying can be presented in 
displays other than shown in Figures 9-15. Further- 
more, from the foregoing it will be appreciated that, 

50 although specific embodiments of the invention have 
been described herein for purposes of illustration, vari- 
ous modifications may be made without deviating from 
the spirit and scope of the invention. Accordingly, the 
invention is not limited except as by the appended 

55 claims. 

Claims 

1 . A system for analyzing software being executed in a 
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target system having a . data bus and an address 
bus, said software containing a plurality of executa- 
ble tag- statements which, when executed, cause 
said target system to write at least one tag to 
respective predetermined locations in the address 5 
space of said target system, said tags containing 
respective tag values con^esponding the locations 
in said software of tag statements generating said 
tags, said system comprising: 

10 

a probe connected to the address and data 
busses of said target system while said target . 
system is executing said software, said probe 
detecting when said predetermined location in 
the address space of said target system is is 
being addressed, said probe capturing a tag on 
the data bus of said target system when said 
probe detects that said predetermined location 
has been addressed; and 

a processor connected to said probe, said 20 
processor determining the software locations 
that have been executed based on the respec- 
tive tag values of said captured tags. 

2. The system of claim 1 further comprising: 25 

a time tag generator connected to said probe, 
said' time tag generator recording respective 
times when said probe captures said tags , on 
tfie data bus of said target system, and 30 
a software performance analyzer connected to 
said processor, said software performance 
analyzer determining the time required to exe- 
cute said software between first and second 
locations based on the difference between first 35 
and second times recorded when first and sec- 
ond tags are captured from said data bus. 

3. The system of claim 1 wherein at least some of said 
tags have a tag type field corresponding to an anal- 40 
ysis function for which said tag is used, and wherein 
said processor includes means for processing said 
tags differently according to their respective analy- 
sis functions based on respective values in said tag 
type field. 45 

4. The system of claim 1 , wherein at least one of said 
tag statements is inserted in said software at a 
location that will cause said tag statement to be 
executed along with said memory allocation state- so 
ment, and wherein an executable data tag state- 
ment is inserted in said software at a location that 

will be executed along with said memory allocation 
statement, said data tag statement when executed, 
causing said target system to write a data tag to a 55 
. second predetermined location in the address 
space of said target system, said data tag contain- 
ing a data value indicative of the memory being 
allocated by said memory allocation statement, and 



wherein said system further includes a memory 
allocation analyzer connected to said processor, 
said memory allocation analyzer determining the 
memory allocation resulting from said memory allo- 
cation statement based on the data value of said 
data tag captured by said probe. 

5. The system of claim 4 wherein said memory alloca- 
tion analyzer further includes means for maintain- 
ing a running account of the memory allocated by 
said memory allocation statements based on the 
data values of respective data tags captured by 
said probe. 

6. The system of claim 5 wherein said memory alloca- 
tion analyzer further includes means for generating 
a graph in essentially real time of the amount of 
memory allocated based on said running account. 

7. The system of daim 1 wherein said tag statements 
are inserted in said software at locations causing 
respective tag statements to be executed along 
with a plurality of function call statements, and 
wherein said system further includes a call pair 
analyzer determining which functions of said soft- 
ware are linked to other functions of said source 
code based on the order in which said probe cap- 
tures said tags. 

8. The system of claim 7, wherein said call pair ana- 
lyzer further includes means for compiling a record 
of the number of times that spec'rfic called functions 
are called by specific calling functions. 

9. The system of claim 7, wherein said call pair ana- 
lyzer further includes means for compiling a statisti- 
cal record of the relatively frequency at which 
specific called functions are called by specific call- 
ing functions. 

10. The system of daim 1 wherein said tag statements 
are inserted in respective branches of said software 
so that said tag statements will be executed along 
with said branches, and wherein said system fur- 
ther includes a code coverage analyzer determin- 
ing which branches of said software have been 
executed based on the tag values of said tags cap- 
tured by said probe. 

11. The system of claim 1 further induding a tag filter 
• connected to said processor, said tag filter compris- 
ing: 

selections means for establishing criteria for 
processing said tags based on their respective 
tag values; 

comparator means for examining each tag cap- 
tured by said probe to determine if said tag 
meets said criteria; and 
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filter means for disregarding tags not meeting 
said criteria so that tags are processed by said 
processor only if said tags meet said criteria. 

12. The system of daim 1 wherein said probe further 5 
comprises: 

a probe tip connected to the address and data 
busses of said target system while said target 
• system is executing said software, said probe io 
detecting when said predetermined location in 
the address space of said target system is 
being addressed, said probe tip capturing a tag 
on the data bus of said target system when 
said probe tip detects that said predetermined is 
location has been addressed; 
a tag preprocessor connected to said probe tip, 
said tag preprocessor examining said tags and, 
based upon the contents of said tags, routing 
said tags to a first output if said tags are being so 
used for a code coverage analysis and routing . 
said tags to a second output If said tags are not 
being used for a code coverage analysis; • 
a code coverage data reduction processor and 
data base connected to the first output of said 2S 
tag preprocessor; 

a tag buffer connected to the second output of 
said tag pr^rocessor, said tag buffer storing 
and then outputting tags received from the sec- 
ond output of said tag preprocessor to said 3o 
processor so that said can determining the 
source code locations that have been executed 
based on the respective tag values of said tags. 

1 3. A method of analyzing software being executed in a 35 
target system having a data bus and an address 
bus, said method comprising: 

inserting a plurality of executable tag state- 
ments at locations in said software, each of 4o 
said tag statements, when executed, causing 
said target system to vyrite a tag to a predeter- 
mined location in the address space of said tar- 
get system, said tags containing respective tag 
values con-esponding the locations in said soft- 45 
ware of tag statements generating said tags; 
allowing said target system to execute said 
software; 

monitoring the address and data busses of said 
target system while said target system is exe- so 
cuting said software; 

detecting when said predetermined location in 
the address space of said target system is 
being addressed; 

capturing a tag on said data bus when address- 55 
ing of said predetermined location is detected; 
and 

determining the software locations that have 
been executed based on the respective tag val- 



ues of said captured tags. 

14. The method of claim 13 further Including the steps 
of: 

recording a first time when a first tag is present 
on said data bus; said first tag having a first tag 
value corresponding to a first location in said 
software of a first tag statement generating 
said first tag; 

recording a second time when a second tag is 
present on said data bus; said second tag hav- 
ing a second tag value corresponding to a sec- 
ond location in said software of a second tag 
statement generating said second tag; and 
tjased on the difference between said first and 
second times; determining the time required to 
execute said software between said first and 
second locations. 

15. The method of claim 13 wherein at least some of 
said tag statements, when executed, cause said 
target system to write a tag to a predetermined 
location in the address space of said target system 
having a tag type field corresponding to an analysis 
function of said tag so that different tags may be 
processed differently according to their respective 
analysis functions. 

16. The method of claim 13, wherein at least one of 
said tag statements is inserted in said software at a 
location that will cause said tag statement to be 
executed along with said memory allocation state- 
ment, and wherein said method further comprises: 

inserting an executable data tag statement in 
said software at a location that will be executed 
along with said memory allocation statement, 
said data tag statement, when executed, caus- 
ing said target system to write a data tag to a 
second predetermined location in the address 
space of said target system, said data tag con- 
taining a data value indicative of the memory 
being allocated by said memory allocation 
statement; 

detecting when said secorKl predetermined 
location in the address space of said target 
system is being addressed; 
capturing a data tag on said data bus when 
addressing of said second predetermined loca- 
tion Is detected; and 

determining the memory allocation resulting 
from said memory allocation statement based 
on the data value of said captured data tag. 

17. The method of claim 16 further including the step of 
placing data tag statements in software locations so 
that a data tag statement will be executed with the 
execution of each memory allocation statement. 
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and wherein, said method further includes the step 
of maintaining a running account of the memory 
allocated by said memory allocation statements 
based on. the data values of respective captured 
data tags. - 5 

18. The method of claim 1 7 further including the step of 
generating a graph in essentially real time of the 
amount of memory allocated based on said running 
account io 

19. The method of claim 13, further including the steps 
of: 

inserting tag statements in said software at is 
locations causing respective tag statements to 
be executed along with a plurality of function 
call statements; and 

based on the order in which said tags are cap- 
tured when addressing of said predetermined 20 
location is detected, determining which func- 
tions of said software are linked to other func- 
tions of said software. 

20. The method of claim 19, further including tiie step. 25 
of compiling a record of tiie number of times that 
specific called functions are called by specific call- 
ing functions, 

21. The metiiod of clairh 19, further including the step 30 
of compiling a statistical record of the relatively fre- 
quency at which specific called functions are called 

by specific calling functions. 

22. The method of claim 13, further including the steps 35 
of: 

inserting tag statements in respective branches 
of said software so that said tag statements will 
be executed along with said branches; and 40 
based on the tag values of said tags captured 
when addressing of said predetermined loca- 
tion is detected, determining which branches of 
said software have been executed. 

45 

23. The method of claim 13 further including the step 
of: 



including a data field having a tag value con-e* 
spending to the software location of a control tag 
statement generating said control tag, said data 
tags being associated with a respective control tag 
and having a data field that provides, information 
about an event identified by the control tag with 
which said data tag is associated. 

25. The method of claim 13 wherein said tags are gen- 
erated by application source code tag statements 
con'esponding to at least one of task creation, task 
deletion, task swap, function entry, and function 
exit, and wherein said method further includes the 
step of determining a task or function nesting hier- 
archy from a record of the source code locations 
that have been executed by said target system. 

26. The method of claim 25, further including the step 
of using said determining a task or function nesting 
hierarchy to control the progress of said analysis. 

27. The metiiod of claim 13 wherein said tags are gen- 
erated by RTOS event tag statements correspond- 
ing to at least one of task creation, task deletion, 
task swap, function entry, and function exit, and 
wherein said method further includes the step of 
determining a task or function nesting hierarchy 
from a record of said RTOS events. 

28. The method of claim 27, further including the step 
of using said determining a task or function nesting 
hierarchy to control the progress of said analysis. 

29. The method of claim 27 wherein said RTOS event 
tags are used to produce a task performance anal- 
ysis. 

30. The method of claim 13 wherein said tag state- 
ments are inserted in the source code of said soft- 
ware, and wherein said method further includes the 
steps of compiling said source code and inserted 
tag statements to obtain object code, and executing 
said object code in said target system- 



establishing criteria for processing said tags 
based on their respective tag values; so 
examining each captured tag to determine if 
said tag meets said criteria; and 
subsequently processing and displaying said 
tags only if said tags meet said criteria, thereby 
filtering said tags after said tags have been ss 
captured. 

24. The method of claim 13 wherein said tags may be 
either control tags or data tags, said control tags 
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